CMPT 215.3 FINAL EXAMINATION 


Instructions 

Read each question carefully and write your answer legibly on the examination paper. No other 
paper will be accepted. You may use the backs of pages for rough work but all final answers must 
be in the spaces provided. The marks for each question are as indicated. Allocate your time 
accordingly. 

Ensure that your name AND student number are clearly written on the examination paper and that 
your name is on every page. 

Note: a reference table of MIPS instructions is provided at the end of the examination paper. 
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1. General (10 marks in total - 1 mark for each part) Give the technical term that best fits each of 
the following descriptions or definitions. 


(a) An operation that tells the assembler how to translate a program but does not produce machine 
language instructions; always begins with a period. : 

(b) The field in the IEEE 754 standard for floating point number representation that stores the 

digits of the fractional part. J 

(c) A cache replacement scheme in which the item replaced is the one that has been unused for 

the longest time. _ (4 Ua<>) 


(d) A technique that uses main memory as a “cache” for programs stored on disk. 

(e) An event that unexpectedly changes the normal flow of instruction execution; as, for example, 
when a program tries to perform an invalid operation (such as a “Iw” to a non-word address), 
and the operating system must be invoked. / 

(f) An advanced pipelining technique that enables the processor to fetch and execute more than 

one instruction per clock cycle. j 

osrSc a 



(g) A type of digital logic circuit without “memory 1 '; the output is a function only of the current 

input .. / ' 4,jtod 

(h) In an assembly language program, a label referring to an object that can be referenced from 

files other than the one in which it is defined. . / i i\ 

(i) A technique for dealing with data hazards, wherein a result value from a preceding instruction 

is provided directly to the instruction that needs it, rather than waiting for the result to be 
written to memory or to the register file. *. 

(j) A field in each entry in a processor cache that contains the address information required to 
determine whether the block stored there is the one being searched for. 
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2. Computer Performance (16 marks in total) 

(a) (2 marks) List two different types of benchmarks, 
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(b) (2 marks) For a particular program as run on a particular system, the number of machine 
language instructions executed is 200 x 10, the clock rate is 500 MHz, and the CPU 
execution time is 80 seconds, What is the CPI? 
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(c) (6 warfcsr) For each of the following parts, suppose that you have obtained some 
measurements of the performance of a system before and after a system change was made. 
Your task is to determine what type of change was made, among the following possibilities: 
coding changes to the high level language program being tested, or new processor with the 
same instruction set architecture, or new processor with a different instruction set architecture. 
In each part you are to assume that only one of these types of change has been made, so you 
must indicate only one of these changes for each part mget any credit. 
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(i) Suppose that the MIPS value for the measured program has increased by 50% and 

the CPU execution time has decreased by 33%, What is the most likely type of 

system change that was made, from the possibilities given in the above list? 
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(ii) Suppose instead that the MIPS value for the measured program has increased by 

25%, the CPU execution time has decreased by 5%, and the CPI has decreased by 
20%, What is now the most likely type of system change that was made, from the 
possibilities given in the above list? 
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(iii) Suppose instead that the MIPS value for the measured program has increased by 
25%, the CPU execution time has decreased by 20%, and the CPI has decreased by 
10%. What is now the most likely type of system change that was made, from the 
possibilities given in the above list? 
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(d) (2 marks) Would you expect a CISC or a RISC processor to have a higher MIPS rating, 
assuming similar age (year of design), design effort, and cost? 


(e) (2 marks) Suppose that the following CPIs and frequencies of usages have been determined 
for the four instruction classes of a particular processor. By what percentage would the MIPS 
rating increase if a hardware improvement was implemented that decreased the CPI for 
instructions of class D from 5 to 2 (without changing anything else)? 
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(f) (2 marks) Consider a particular program that spends 20% of its execution time on floating 
point instructions. Write an expression for the percentage by which total execution time 
would be decreased, if the time it takes to execute each floating point instruction was 
decreased by p %. 
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3. Arithmetic (16 marks in total) 
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(a) (3 marks) In IEEE 754 floating point format, what are the special meanings of exponent field 
values of 00.. .0 and M... I ? 
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(b) (3 marks) Give a truth table for a logic function whose 3 inputs are the binary digits of a 3 bit 
signed integer* and whose 3 outputs give the binary' digits of -x. Assume signed integers are 
represented using 2’s complement, and ignore overflow. 
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(c ) (8 marks) Give the base 10 number that is represented by 110010, assuming each of the 
following representations: 


(i) 6 bit 2’s complement 

00 U O' 

f 

00 I' ' 0 

(ii) 6 bit biased notation with bias of 31 

/1 OCS l $ 

^ SO 
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(iii) 6 bit l’scomplement 

11 00 >0 
001*01 
$ H 1 * - ‘3 





(iv) 6 bit sign-magnitude 
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(d) (2 tffczrfo) Recall that in the IEEE 754 floating point standard, single precision floating point 
numbers have a 1 bit sign field, followed by an 8 bit exponent field (in biased notation with a 
bias of 127), followed by a 23 bit field giving the digits of the fractional part. What decimal 
number is represented by 1100000011 OH 0000000000000000000 ? 
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4. Machine and Assembly Language (20 marks in total) 

(a) (2 marks) Describe what is done within each pass of a 2 pass assembler. 

^ A Qi i a 


(b) (2 marks) Suppose MIPS had 64 general-purpose registers rather than just 32. What impact 
would this have on the design of MIPS machine language? 
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(c) (4 marks) Write a well-structured fragment of C or Java or Eiffel code that accomplishes the 
same task as the following sequence of MIPS assembly language instructions. Use the 
variable “i” for register $sl, ‘V’ for register $s2, and an integer array fct A” (indexed from 0 to 
n-1) for the sequence of words whose starting memory address is initially in Ss3. Your code 
should result in the same final assignment to ‘T, as is made to register Ssl. 


add 

Ssl,$s2,$zero 

L: beq 

$sl,$zero,out 

lw 

$tl,0($s3) 

beq 

Stl,Szero,out 

addiu 

Ss3,Ss3,4 

addi 

Ssl,$sl,-1 

j 

L 

out: add 

Ssl,Ssl,Ssl 
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(d) (12 marks ) Write a MIPS procedure that “reverses" an array. Specifically, your procedure 
should take two parameters. The first parameter (passed in SaO) is the starting memory 
address of an array of words, The second parameter (passed in $al) is the number of items in 
the array. Your procedure should reverse the array by exchanging its first and last items, then 
the second and second last items, and so on. For example, if your procedure is passed the 
address of the array 1 8 3 5 3 0, with the number of items being 6 in this case, your procedure 
should transform this array into 0 3 5 3 8 1. 
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5. Datapath and Control (20 marks in total) 


(a) (4 marks) Determine the total number of clock cycles required to execute the program 
segment given below using a 5-stage pipeline without forwarding. The five stages and their 
functions are given as follows: 


• IF 

• ID 

• EX 

• MEM 

• WB 


- instruction fetch (from instruction memory) 

- instruction decode and register file read 

- execute or address calculation 

- memory access (from/to data memory) 

- write back to register file 


Assume that if one instruction reads a register during the same clock cycle as another 
instruction is writing it, the new value will not be read. Note that an instruction takes 5 clock 
cycles to complete (if not stalled). Do not reorder the instructiqps* Instructions are fetched 
and executed exactly in the order given below. 


b 


a 





Iw 

Ssl,0{Ss2) 


Iw 

Stl ,0($sl) 

—-■ 

addi 

Ss3,Ssl,4 


addi 

Ssl.Stl1 


add 

StO,Ssl,Ssl 



(b) (4 marks) Repeat part (a), but now assuming that the pipeline does use forwarding, and that if 
one instruction reads a register during the same clock cycle as another instruction is writing it, 
the new value will be read. 
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(c) (4 marks ) Consider the single-cycle, multicycle, and pipelined (with hazard detection, 
forwarding, and branch prediction) implementations of a MIPS subset discussed in class. 


(i) 

(ii) 

(iii) 

(iv) 


With which of these implementations would you expect to have the highest CPI? 
With which would you expect to have the second highest CPI? 

With which would you expect to be able to achieve the highest clock rate? 



or 

With which would you expect to have the lowest clock rate? 
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(d) (4 marks) fn what ways might compiler techniques for dealing with hazards be preferable to 
hardware approaches? In what ways might hardware approaches be preferable? 










— Sfr*yaUs- “TU 

. i. A y-S 




Q,<ilty Iua-z £**(>£) 


(e) (4 marks) Describe how dynamic branch prediction can be implemented, and 1-bit and 2-bit 
dynamic branch prediction schemes. p b ^JJn& 
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6, Cache and Virtual Memory (18 marks in total) 

(a) (4 marks ) Consider a cache with space to store 16 blocks. 

(i) Assuming the cache is set associative with each position of the cache having space 
for 2 blocks, give a formula for the cache position in which memory block number 
N would be stored. Q. h l cc^s - £ * 


(ii) 


Assuming instead that the cache is direct mapped, and there are 8 words in a block, 
which cache position would be checked on a reference to (byte) address 1400? 
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(b) (4 marks ) State tour of the benefits provided by the virtual memory technique. 
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(c) (2 marks) Suppose that in some system utilizing paged virtual memory, a physical page 
number is 12 bits, a virtual page number is 48 bits, and a virtual address is 64 bits. What is 
the maximum amount of physical memory (in Mbytes) that this system could have? 
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(d) (4 marks) Outline the steps by which a virtual memory address is translated into a physical 
memory' address, in a computer system using the virtual memory' technique. 
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(e) (4 marks) Define temporal and spatial locality, and for each type of locality, give an example 
of how it is exploited in the management of the memory hierarchy. 



The End 



